Goto

Collaborating Authors

 exp null 2







On Weak Regret Analysis for Dueling Bandits

Neural Information Processing Systems

When the optimality gap is negligible, we propose another algorithm that outperforms our first algorithm, highlighting the subtlety of this dueling bandit problem.